CDS
Accession Number | TCMCG075C09932 |
gbkey | CDS |
Protein Id | XP_017972869.1 |
Location | join(27810218..27810540,27811095..27811241,27811410..27811563,27812216..27812351,27812781..27812926,27813596..27813691,27813854..27813906,27814958..27815089,27815773..27815824,27815929..27816048,27816357..27816491,27817015..27817143,27817623..27817700,27817905..27818045,27818313..27818407,27818904..27819100,27819269..27820056,27820434..27820514,27820601..27820666,27820763..27820840,27821540..27821770,27822143..27822274,27822389..27822565,27834284..27834358) |
Gene | LOC18605417 |
GeneID | 18605417 |
Organism | Theobroma cacao |
Protein
Length | 1253aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA341501 |
db_source | XM_018117380.1 |
Definition | PREDICTED: cleavage and polyadenylation specificity factor subunit 1 isoform X5 [Theobroma cacao] |
EGGNOG-MAPPER Annotation
Sequence
CDS: ATGAGCTACGCGGCCTACAAGATGATGCACTGGCCCACCGGAATCGATAACTGCGCCTCCGGCTTCGTCACCCACTGCCGTGCGGATTTCACGCCTCAAATTCCGCTCAACCAGACTGAAGATCTTGAATCTGAATGGCCCGCAAGGCGCGGCATTGGTCCCGTTCCGAACCTCATCGTCACCGCTGCTAACCTTCTCGAAATCTATGTGGTCAGGGTTCAAGAAGAGGGCAGAAGGGAAGCCAGAAACTCCACTGAAGTCAAGCGCGGCGGGGTTTTGGATGGGGTATCCAGGGTTTCTCTCGAGCTCGTTTGCAATTATAGGTTACATGGTAATGTTGAATCTATGGCGGTACTATCTATAGGAGGTGGTGATGGCTCTAGGAGGAGAGATTCAATTATCTTAGCCTTTCAAGATGCCAAAATTTCGGTCCTGGAGTTTGATGATTCCATCCATGGTCTTCGAACAACCTCAATGCATTGCTTTGAGGGCCCGGAGTGGCTTCATTTGAAAAGAGGAAGAGAATCATTTGCTAGAGGGCCACTGGTAAAGGTTGATCCTCAAGGCAGGTGTGGCGGCGTTCTTGTTTATGATTTGCAAATGATAATACTTAAAGCTTCTCAGGCTGGTTCTGGATTTGTGGGAGAGGATGATGCTTTTGGATCTGGAGGTGCAGTTTCTGCTCGTGTTGAGTCATCTTACATTATCAATTTACGAGATTTGGACGTGAAGCACATTAAAGATTTTATATTTGTGCATGGGTATATTGAGCCGGTGATGGTTATCCTTCATGAGCGGGAGCTTACTTGGGCTGGGCGAGTCTCTTGGAAGCACCATACTTGCATGATTTCTGCACTTAGTATTAGCACAACCTTGAAGCAGCATCCTCTCATATGGTCAGCAGTTAATCTTCCTCATGATGCTTACAAGCTGCTTGCAGTTCCGTCGCCAATTGGAGGTGTTCTTGTGATTAGTGCAAATACTATTCATTATCATAGTCAGTCGGCTTCATGCGCACTTGCTTTGAACAATTATGCTATTTCTGTTGATAACAGTCAAGACCTTCCAAGATCAAATTTCAGTGTAGAACTTGATGCTGCTAATGCAACTTGGTTACTAAATGATGTAGCCTTGCTATCAACAAAGACTGGAGAACTGTTATTGCTGACCCTTATTTATGATGGGCGGGTTGTGCAGAGACTTGATCTTTCCAAGTCCAAGGCTTCAGTACTTACTTCGGACATTACAACTATTGGAAATTCATTGTTCTTTTTGGGTAGTCGATTGGGAGATAGTTTGCTTGTGCAATTCAGTGGTGGATCAGGAGCGTCAGCCTTGCCATCTGGTTTGAAGGAAGAGGTTGGAGATATTGAAGGTGATGTCCCTCTGGCAAAGCGATTGCGAAGGTCATCTTCTGATGCTTTGCAAGATATGGTTGGCGGTGAAGAGCTTTCTTTGTATGGTTCGGCCCCAAATAACACTGAGTCAGCACAGAAGACTTTCTTGTTTGCAGTGAGAGACTCATTAACTAATGTTGGCCCTTTGAAGGACTTCTCATATGGCTTGAGGATTAATGCTGATGTGAATGCAACTGGAATTGCCAAACAAAGTAATTATGAGCTGGTGTGCTGTTCTGGCCATGGAAAGAATGGTGCCCTCTGTGTTCTACGACAGTCAATTCGTCCTGAAATGATTACCGAGGTTGAACTAACTGGTTGTAAAGGAATTTGGACTGTCTACCACAAGAGCACACGCAGTCACAGTGCTGATTTGTCTAAAGTGACTGATGATGATGATGAATATCATGCATATTTGATTATAAGTCTGGAGGCGCGCACCATGGTGCTTGAAACAGCTGATCTTTTGACAGAAGTGACTGAAAGTGTAGACTATTATGTTCAAGGAAGAACAATTGCTGCAGGAAATTTGTTTGGAAGGCGTCGAGTTGTCCAGGTCTATGAACGTGGTGCTCGAATTCTGGATGGTTCTTTTATGACTCAAGAACTGAGTATTCCATCACCAAACTCTGAATCTAGCCCTGGTTCTGAGAATTCTACAGTAATATCTGTTTCTATTGCTGATCCTTATGTGTTGCTAAGAATGACTGATGGAAGCATTCTTCTCCTTGTTGGAGATCCTGCTACTTGCACTGTTTCTATAAACACTCCAACTGCATTTGAAGGCTCAAAGAAAATGGTATCTGCCTGTACATTGTATCATGATAAAGGTCCAGAGCCATGGCTCCGCAAAGCAAGTACTGATGCGTGGCTTTCCACTGGCGTCGGGGAGTCCATTGACGGTGCTGATGGTGGGCCACATGATCAAGGGGATATATATTGTGTCGTTTGTTATGAGAGTGGTGCTCTTGAAATATTTGATGTGCCAAATTTCAATTGTGTTTTCTCTATGGAAAATTTTTCATCTGGAAGAACCCGCCTTGTTGATGCCTATACACTGGAATCTTCTAAGGATTCTGAGAAAGTGATTAATAAAAGTTCTGAAGAATTGACTGGCCAAGGCAGGAAAGAAAATGTTCAAAACCTGAAGGTTGTTGAGTTGGCCATGCAGAGATGGTCTGCAAATCACAGTCGTCCATTTCTTTTTGGAATATTAACGGATGGAACAATTCTTTGTTATCATGCTTACCTATTTGAAGGTTCAGAAAATGCTTCTAAAGTTGAGGATTCAGTTGTTGCACAAAATTCTGTTGGCTTAAGCAATATTAATGCTTCTAGGCTTAGGAATTTGAGATTTATTCGCATCCCGTTGGATGCTTACACGAGGGAGGAGATGTCAAATGGAACCTTATCCCAAAGGATTACAATTTTTAAGAATATTAGTGGTTATCAAGGGTTCTTCCTCTCTGGTTCAAGACCAGCTTGGTTTATGGTATTCAGAGAACGGCTTCGAGTTCATCCACAGCTATGTGATGGATCTATTGTTGCTTTCACTGTTCTTCATAATGTCAACTGTAATCATGGGTTCATATATGTTACATCGCAGGGTATTCTAAAGATTTGCCAAATCCCATCTGCATCAAACTATGACAACTATTGGCCAGTGCAAAAAATTCCACTAAGGGGCACTCCACATCAAGTGACTTACTTTGCTGAGAGGAATCTTTACCCAATTATAGTTTCAGTTCCTGTTCATAAGCCAGTTAATCAAGTGCTATCTTCATTGGTTGATCAAGAAGTTGGCCATCAGATGGACAATCATAATTTGAGTTCTGATGAGTTGCAACGAACTTATACAGTGGATGAGTTCGAGGTTCGGATTTTGGAACCTGAAAAATCTGGTGGTCCTTGGGAAACTAAGGCAACTATACCAATGCAGAGTTCTGAAAATGCTCTAACTGTGAGAGTGGTCACTCTGTTTAATACCACCACAAAAGAGAATGAATCCCTTTTGGCTATTGGGACAGCTTACATTCAAGGAGAGGATGTTGCTGCTAGAGGACGTGTGATTTTGTGTTCAATTGGAAGGAACACTGATAATCCTCAGAATTTGGTGTCAGAGGTTTATTCAAAGGAACTAAAAGGTGCTATATCTGCTTTAGCCTCCCTTCAAGGTCATCTATTGATAGCTTCTGGTCCGAAAATTATTCTACATAATTGGACTGGTAGTGAGCTGAATGGCATTGCATTTTATGATGCTCCACCATTATATGTTGTGAGCTTAAATATAGTCAAGAATTTTATCCTTCTTGGTGATGTTCACAAGAGCATATACTTTTTAGTTGGAAGGAATAGGTTACTGTAG |
Protein: MSYAAYKMMHWPTGIDNCASGFVTHCRADFTPQIPLNQTEDLESEWPARRGIGPVPNLIVTAANLLEIYVVRVQEEGRREARNSTEVKRGGVLDGVSRVSLELVCNYRLHGNVESMAVLSIGGGDGSRRRDSIILAFQDAKISVLEFDDSIHGLRTTSMHCFEGPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYDLQMIILKASQAGSGFVGEDDAFGSGGAVSARVESSYIINLRDLDVKHIKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAVNLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALALNNYAISVDNSQDLPRSNFSVELDAANATWLLNDVALLSTKTGELLLLTLIYDGRVVQRLDLSKSKASVLTSDITTIGNSLFFLGSRLGDSLLVQFSGGSGASALPSGLKEEVGDIEGDVPLAKRLRRSSSDALQDMVGGEELSLYGSAPNNTESAQKTFLFAVRDSLTNVGPLKDFSYGLRINADVNATGIAKQSNYELVCCSGHGKNGALCVLRQSIRPEMITEVELTGCKGIWTVYHKSTRSHSADLSKVTDDDDEYHAYLIISLEARTMVLETADLLTEVTESVDYYVQGRTIAAGNLFGRRRVVQVYERGARILDGSFMTQELSIPSPNSESSPGSENSTVISVSIADPYVLLRMTDGSILLLVGDPATCTVSINTPTAFEGSKKMVSACTLYHDKGPEPWLRKASTDAWLSTGVGESIDGADGGPHDQGDIYCVVCYESGALEIFDVPNFNCVFSMENFSSGRTRLVDAYTLESSKDSEKVINKSSEELTGQGRKENVQNLKVVELAMQRWSANHSRPFLFGILTDGTILCYHAYLFEGSENASKVEDSVVAQNSVGLSNINASRLRNLRFIRIPLDAYTREEMSNGTLSQRITIFKNISGYQGFFLSGSRPAWFMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQIPSASNYDNYWPVQKIPLRGTPHQVTYFAERNLYPIIVSVPVHKPVNQVLSSLVDQEVGHQMDNHNLSSDELQRTYTVDEFEVRILEPEKSGGPWETKATIPMQSSENALTVRVVTLFNTTTKENESLLAIGTAYIQGEDVAARGRVILCSIGRNTDNPQNLVSEVYSKELKGAISALASLQGHLLIASGPKIILHNWTGSELNGIAFYDAPPLYVVSLNIVKNFILLGDVHKSIYFLVGRNRLL |